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Abstract 

We study the logistic map f(x) = Xx(l — x) on the unit square at 
the chaos threshold. By using the methods of symbolic dynamics, the 
information content of an orbit of a dynamical system is defined as 
the Algorithmic Information Content (AIC) of a symbolic sequence. 
We give results for the behaviour of the AIC for the logistic map. 
Since the AIC is not a computable function we use, as approximation 
of the AIC, a notion of information content given by the length of 
the string after it has been compressed by a compression algorithm, 
and in particular we introduce a new compression algorithm called 
CASToRe. The information content is then used to characterise the 
chaotic behaviour. 



1 Introduction 

Since the discovery of impredictability in deterministic systems that led to the 
study of chaotic dynamical systems, much work has been done to find prop- 
erties of chaos that could give a classification of these systems. In particular 
indicators like Lyapounov exponent, Kolmogorov-Sinai entropy, topological 
entropy and others have been developed. 

Nevertheless, in recent years there have been found dynamical systems 
for which all the known indicators don't give the presence of chaos, but 
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numerical results show a high order of impredictability for orbits of these 
systems. This phenomenon has led to attempts to generalize the known 
indicators (for example see but has also stimulated the research for 

new properties of these systems. 

One of the possible new approaches to these weakly chaotic dynamical 
systems is related to information theory, that is to the notion of information 
content of a symbolic string. In Section |2] we give the definition of information 
content and show how to apply this technique to dynamical systems. 

In particular we study the dynamics of the logistic map (see equation 
(0)) at the chaos threshold, that represents one of the best known examples 
of weak chaos. In Section ^ we give the theoretical results that allow a 
classification of our map among the weakly chaotic dynamical systems. 

Our theoretical approach is based on the notion of Algorithmic Informa- 
tion Content introduced by Kolmogorov and Chaitin (see Section and 
unfortunately this notion of information is not computable, in the sense that 
cannot exist an algorithm which computes this function. Hence to numeri- 
cally confirm our results, we have to use an approximate notion of informa- 
tion content. In Section |], we present a possible approach to this problem, 
that consists on the use of compression algorithms to obtain an estimate 
for the information contained in a string. In particular a new compression 
algorithm, called CASToRe, is presented and some of its properties are illus- 
trated. Finally in Section |, we apply CASToRe to the logistic map at the 
chaos threshold to confirm the theoretical predictions for the behaviour of 
the Algorithmic Information Content of the strings generated by the map. 



2 Information of symbolic sequences 

In our approach the basic notion is the information content of a symbolic 
sequence. Given a finite string s n of length n, that is a finite sequence of 
symbols S{, i — 1, . . . , n, taken in a given alphabet, the intuitive meaning of 
quantity of information I(s n ) contained in s n is the following one: 

I(s n ) is the length of the smallest binary message from which you can 

reconstruct s n . 

Formally, this notion leads to the definition of the Algorithmic Informa- 
tion Content (AIC) (or Kolmogorov complexity) introduced by Chaitin (||) 
and Kolmogorov (|TB[). In order to define it, it is necessary to define the 
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partial recursive functions. We limit ourselves to give an intuitive idea which 
is very close to the formal definition. We can consider a partial recursive 
function as a computer C which takes a program p (namely a binary string) 
as an input, performs some computations and gives a string s = C(p), writ- 
ten in the given alphabet, as an output. The AIC of a string s is defined as 
the shortest binary program p which gives s as its output, that is 

lAic(s,C)=mm{\p\:C(p) = s}, (1) 

where \p\ means the length of the string p. A computing machine is called 
universal (namely it is a computer in the usual sense of the word) if it can sim- 
ulate any other machine (for a precise definition see any book on recursion) . 
In particular if C and C are universal then Iajc{s, C) < Iaic(s, C) + const, 
where the constant depends only on C and C . This implies that, if C is 
universal, the complexity of s with respect to C depends only on s up to 
a fixed constant and then its asymptotic behavior does not depend on the 
choice of C. Thus, we will just write Iaic(s) m equation flU) when referring 
to the AIC of the string s. 

We now extend the concept of information to strings generated by a 
dynamical system (X,f). Using the usual procedure of symbolic dynamics, 
given a partition a of the phase space of the dynamical system (X,f), it 
is possible to associate a string $ a (x) to the orbit having x G X as initial 
condition. If a = {A±, . . . , Ai}, then $ a (x) = (so, s±, . . . , Sf., ■ ■ • ) if and only 
if 

f k x G A Sk Vfc, 

with Sk G {1, . . . 1} V7c. The AIC of a single orbit of the dynamical system 
is then the AIC of the symbolic string generated by the orbit. If we want 
not to have dependence on the partition a of the phase space, then we have 
to make some procedure of looking for the supremum varying the partitions. 
The first results have been obtained by Brudno (||) using open covers of 
the phase space. Another possible approach, using computable partitions[], 
is introduced in M. We are not interested in this problem, but we will just 
assume that the partitions we consider are generating in the sense that they 
give the best approximation for the AIC of an orbit of our dynamical system. 

lr The intuitive idea of computable partition is a partition that can be recognised by a 
computer. 
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There exist some results connecting the information content of a string 
generated by a dynamical system and the Kolmogorov- Sinai entropy h KS of 
the system. 

First of all it is proved that in a dynamical system with an ergodic in- 
variant measure \x with positive K-S entropy h^ s , the AIC of a string n 
symbols long behaves like lAic{s n ) ~ h^ s n for almost any initial condition 
with respect to the measure ji (||). 

Instead, in a periodic dynamical system, we expect to find Iaic(s u ) = 
0(log(n)). Indeed, the shortest program that outputs the string s n would 
contain only information on the period of the string and on its length. 

It is possible to have also intermediate cases, in which the K-S entropy is 
null for all the invariant measures that are physically relevant and the system 
is not periodic. These systems, whose behaviour has been defined weak chaos, 
are an important challenge for research on dynamical systems. Indeed no 
information are given by the classical properties, such as K-S entropy or 
Lyapounov exponents, and in the last years some generalized definitions of 
entropy of a system have been introduced to characterize the behaviour of 
such systems (for example see jl5| ). We believe that an approach to weakly 
chaotic systems using the order of increasing of their AIC could be a powerful 
way to classify these systems. This approach has already been used for the 



Manneville map (JTT 



f(x) = x + x z (mod 1) x € [0, 1] , z > 1 . 

In the cited papers it is proved that the AIC of the Manneville map behaves, 
for values z > 2 of its parameter, as 

I A ic(s n ) ~ n a , a = -l T <l, (2) 

and this behaviour has been defined sporadicity. 

Among the weakly chaotic systems, one can identify behaviours different 
from the sporadic one. As an example, the AIC can have order smaller than 
any power law (with respect to the length of the encoded string). We choose 
to call this behaviour mild chaos. 

In this paper we show that the AIC of the logistic map at the chaos 
threshold is of order of logn (Theorem [3.10 and Section |5[). Then, we will 



prove that the weakly chaotic dynamics of the logistic map at the chaos 
threshold is in particular mildly chaotic. 
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3 The logistic map from information point of 
view 



We now apply the theory of the Algorithmic Information Content of an orbit 
of a dynamical system to the logistic map at the chaos threshold. We first 
give some well known results on the dynamics of the map, and then use these 
results to obtain an estimate for the AIC. 

3.1 The dynamics of the logistic map 

The logistic map, defined by 

fx(x) = Xx(l - x) , x G [0, 1] , 1 < A < 4, (3) 

is a very simple example of a map with an extremely complicated behaviour 
for some values of the parameter A. The dynamics of the map have been 
studied extensively, and there are many important results that have been 
generalized to one dimensional dynamical systems. Here we give a brief 
description of the well known period doubling sequence, and recall some 
results for the dynamics at the chaos threshold. The references for the first 
part of this section are 0, ||, |J. 

Consider first A G (1,3). There are two fixed points, xq = and x\ = 
(A — 1)/A. The fixed point at the origin is unstable and the point x\ is stable. 
Then every orbit ends up on the fixed point x\. 

When A = A i = 3, we have the first bifurcation. The derivative of / at 
the point x\ is -1, and the fixed point is now neutrally stable. Moreover one 
periodic orbit of period 2 is generated and it becomes stable as soon as A 
becomes greater than Ai. This is a period doubling bifurcation. 

This kind of bifurcation repeats over and over at different values A n of the 
parameter. That is, when A G (A n ,A n+ i), there is one stable periodic orbit 
of period 2 n+1 , and unstable periodic orbits of period 2 J , j = 0, . . . , n. When 
A = A n+ i, the stable periodic orbit loses its stability and a new periodic orbit 
of period 2 n+2 is generated. 

One of the main features of this bifurcation sequence is that the bifurca- 
tion parameters A n accumulate at A = Aqo, where 

Aoo = 3.569945671870944901842 . (4) 



5 



Moreover it holds 



A n — — — , (5) 
where 5 is the so-called Feigenbaum constant and b is a suitable constant 

(H) - 

The behaviour of the logistic map at the chaos threshold has attracted 
much attention, in particular for being a map with null Kolmogorov entropy 
for any invariant probability measure, and nevertheless showing a weakly 
chaotic behaviour. 

We recall that in this paper by weak chaos we mean the behaviour of a 
map with null Kolmogorov entropy for all the physically relevant invariant 
measures and with a dynamics that is neither periodic nor regular in some 
sense. As we have seen in Section |2], a way to classify weakly chaotic maps 
is given by the Algorithmic Information Content (AIC) of a string generated 
by the map. 

Let's consider now the map fx x . For this map there are countably many 
unstable periodic orbits of periods 2\ for all j G N, and an attractor Q that 
is a Cantor set, the so-called Feigenbaum attractor. It holds the following 
theorem (0, Theorem III. 3. 5): 

Theorem 3.1. The logistic map f\ x at the chaos threshold has an invariant 
Cantor set Q. 

(I) There is a decreasing chain of closed subsets 

J(°) D J« D J® D . . . , 

each of which contains 1/2, and each of which is mapped onto itself by f Xoa . 

(2) Each JW is a disjoint union of2 l closed intervals. is constructed by 
deleting an open subinterval from the middle of each of the intervals making 
up Jw. 

(3) fxoo maps each of the intervals making up jW onto another one; the 
induced action on the set of intervals is a cyclic permutation of order 2\ 

(4) = Cii-J^ . fx x maps Vl onto itself in a one-to-one fashion. Every orbit 
in fl is dense in Q. 

(5) For each k G N, fx^ has exactly one periodic orbit of period 2 k . This 
periodic orbit is repelling and does not belong to j( fc+1 ) . Moreover this periodic 
orbit belongs to J^ k > \ J( fc+1 \ and each point of the orbit belongs to one of the 
intervals of J® . 



6 



(6) Every orbit of fx^ either lands after a finite number of steps exactly on 
one of the periodic orbits enumerated in 5, or converges to the Cantor set Q 
in the sense that, for each k, it is eventually contained in J^ k \ There are 
only countably many orbits of the first type. 

A characteristic of chaotic dynamical systems is the sensitivity to ini- 
tial conditions. Roughly speaking we can say that a system has sensitive 
dependence on initial conditions if two orbits that start close diverge. 

It is well known that the Kolmogorov-Sinai entropy is related to the 
sensitivity to initial conditions of the orbits. The exact relations between the 
K-S entropy and the instability of the system is given by the Ruelle-Pesin 
theorem. We will recall this theorem in the one-dimensional case. Suppose 
that the average rate of separation of nearby starting orbits is exponential, 
namely 

Ax{n) ~ Ax(0)2 An for Ax(0) < 1, 

where Ax(n) denotes the distance of these two points at time n. If the 
Lyapounov exponent A is positive then the system is unstable and A can be 
considered a measure of its instability (or sensibility with respect to the initial 
conditions). The Ruelle-Pesin theorem implies that, under some regularity 
assumptions, A equals the K-S entropy. 

The logistic map at the chaos threshold has null Kolmogorov entropy for 
any invariant probability measure (it is shown by continuity of topological 
entropy and by the Variational Principle), and null Lyapounov exponent 
(experimental results), but nevertheless numerical experiments show that 
there is a power law divergence for nearby orbits. This feature of the logistic 
map has induced the application of a generalized version of the thermo- 
dynamical entropy ([fL5[|, ]IB[). Formally, we have: 

Definition 3.2. A dynamical system / : X — * X has sensitivity to initial 
conditions if there exists 5 > such that, for all x G X and for all neighbour- 
hoods U of x, there exist y 6 17 and fiGN such that d(f n (x), f n (y)) > 6. 

Theorem 3.3. The logistic map at the chaos threshold has no sensitivity to 
initial conditions. Indeed there exists a subset X C [0, 1] with m(X) = 1, 
for the Lebesgue measure m, such that for all 5 > and for all x G X, there 
exists a neighbourhood U(S) of x such that Wy G U(5) and Wn G N we have 

d(fU x )JL(y))< s - 
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Proof. By Theorem p. 1( 6), we have that eventually almost every orbit, with 
respect to the Lebesgue measure m, ends up on for all k. Moreover by 
Theorem Ql), we have that fx^J^) C J (fc) . 

Let now 5 > be fixed. Being Q a Cantor set, there exists ks such that 
) < 5 for all i = 1, . . . ,2 fc *, where by Jj we denote the intervals 
making up j( ks \ We have (Theorem |3.1| (3)) that the action of f\ x on is a 
cyclic permutation of the 2 ks intervals. Then if x G j\ ks \ for some z, and C/ (<5) 
is a neighbourhood of a; contained in , then, for all ra, /"^(f (5)) C jj fc,5 \ 
for some j. Hence G £7(5) and Vn G N, we have d(f™ (x), f\(y)) < 8. 

Let now X be the set of the points whose orbit is eventually contained 
in j( ks \ By Theorem |3.1| (6), we have m(X) = 1. Let now be x G X, then 
there exists a neighbourhood U{6) of x, such that, if fx ( x ) ^ ^i^' 5 '' f° r some 
i, then (U(5)) C ^ . The neighbourhood U{5) exists by continuity of 
the map f\ x . Now, we can repeat the same argument as before, showing 
that Vy G U(S) and Vra G N, we have d(f™ (x), f% (y)) < 5 (if necessary it is 
possible to shrink the neighbourhood U(5) to have the previous relation for 
all n G N). □ 

Hence it remains the problem of explaining the power law divergence of 
nearby orbits. One way to give an estimate of the order of this divergence is 
by considering the dimension of the set of points that remain close for some 
iterations. 

Definition 3.4. For x G [0, 1], we define 

B(x, n, e) = {ye [0, 1] | d(f* (x), f* (y)) <eVi = n}. 



Using this definition, we apply Theorem 3.3 to obtain 



Corollary 3.5. For all e > and for all x G X, there exists a N(x, e) G N, 
such that B(x, n, e) = B(x, m, e) for all n,m > N(x, e). 

Hence, given a point x G X and e, the set B(x, n, e) shrinks as n increases, 
until n reaches the value N(x,e). At this point the set B(x,n,e) doesn't 
change any more. But, if we are interested in estimate the order of divergence 
of nearby orbits at the point x, we have to consider the limit 

B{x,n) = lim B(x,n,e). (6) 

e— >0+ 
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We find that lim e ^ + N(x, e) = +00, for all hence the function B(x, n) 

is a decreasing function of n, whose order gives information on the order of 
the local divergence. In the next subsection we derive an estimate for the 



AIC of a string generated by f\ x , and hence, by a theorem of Hl2[ , we find 
the order of B (x, n). 

3.2 The AIC at the chaos threshold 

In this subsection we show an approach to the dynamics of the logistic map 
at the chaos threshold, fx^, that allows us to find an estimate for the AIC 
(equation (|l|)) of almost every orbit generated by f\ x . This approach uses 
the notion of kneading invariant. To have a complete treatment of kneading 
theory see, for example, ||. 

Let / : [0, 1] — > [0, 1] be a C l unimodal map, that is there exists only 
one point c G [0, 1] such that /'(c) = 0, and / is increasing on (0, c) and 
decreasing on (c, 1). We can then define the kneading sequence kf(x,t) of a 
point x G [0, 1]. 

Definition 3.6. For x G [0, 1], we define coefficients 9i(x) by: 

' +1 if £/ (l+1) > 

0i(x) = I -1 if £f (i+1) < 

if £f (l+1) = 

Then the kneading sequence relative to x is the formal power series 

00 

k f (x,t) = ^2 9i(x) f. 

i=0 

Theorem 3.7. It is possible to introduce a distance D on the space of knead- 
ing sequences, given by 

00 . 

D(k f (x,t),k f (y,t)) = J2 J t mx)-e i (y)\. 

i=0 

This distance makes the function x — ► kf(x,t) a continuous function for all 
x G [0, 1], but the preimages of the critical point c. 
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Definition 3.8. If x is a preimage of the critical point c, we define 

k/(x±,t) = lim kf(y,t), 

where the limit is taken through sequences of points y that are not preimages 
of c. For the critical point c, it holds fc/(c_,t) = — kf(c + ,t). Then we define 
the kneading invariant kf(t) of the function / as the sequence kf(c_,t). 

Using classical results on renormalization of maps with null topological 
entropy (f|), one can prove the following: 

Theorem 3.9. The kneading invariant of the logistic map at the chaos thre- 
shold /aoo is given by 

oo 

**.(*) = II ( 7 ) 

i=0 

Let's consider now the partition a = (A ,Ai), given by A = [0,1/2] 
and Ai = (1/2, 1], of the interval [0, 1] for the map fx^. If we take a point 
x G [0, 1], we can code its orbit into a string s with s« G {0, 1}, according to 
whether (x) is in A or in A\. 

From the definition of the coefficients 6i(x), for any point x, we can see 

that 9i(x) = sgn (f'(x) f'{f l { x )))- Hence, in the coding of the orbit of 

x into the string s, we have Sj = if there is a permanence in the sign of 
and 9i(x), Sj = 1 otherwise. 

We denote by s(x) the string generated by the orbit with initial condition 
x. Let's start studying the string generated by the point x = 1/2, that is the 
critical point of f\ x . The coding of the orbit of 1/2 into a string s is related 
to the kneading invariant of the map kf x (t), given by Theorem |3.9| . Thanks 
to the particular form of the kneading invariant (eqn. (|7])), we obtain that 
to reconstruct the string s, it is sufficient to make the following operations: 

• the first symbol is 0; 

• given s n , that is the first n symbols of the string, we construct s 2n by 
s n s n , where s n is equal to s n if n = 2 2k , otherwise s n is obtained by s n 
changing the first symbol. 

Hence we have that 1 Aic(s n ^(1/2)) = logn, indeed it is enough to specify 
the length of the string s n . Moreover, by Theorem |3.1| (1), we have that 
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1/2 G Q, the attractor of f\ x , and the orbit of 1/2 is dense in Q (Theorem 
pTT](4) ) . Hence for all x G X (see Theorem |3^3|) , there exists N(x) such that 
for all i > N(x), Si(x) can be obtained by Sj(l/2). This implies the following 
result: 

Theorem 3.10. For almost any point x G [0, 1], the AIC of the string s(x) 
generated by the orbit of the map fx^ with x as initial condition, is such that 

lAic{s n {x)) = logn + const, (8) 

where the constant depends on the point x. 

Remark 3.11. We remark that we have actually proved only the behaviour 
of the AIC for the partition a. Indeed to obtain the AIC of the orbits of our 
dynamical system we should consider the supremum on either all the open 
covers or the computable partitions (see Section |). But we have that a is 
a generating partition, that is lim^oo diam(a n ) = 0, where diam(a) is the 
diameter of a partition, and a n = a A • ■ ■ A f~ n (a). Hence we suppose that 
the AIC we estimated is a good approximation of the real one, following the 
same arguments used, for example, for the Kolmogorov- Sinai entropy. 

We are now ready to give an estimate on the behaviour of the function 
B(x,n) defined in equation ([]). Indeed, using Theorem 40 in ||12|| , we have: 

Corollary 3.12. For almost any point x G [0, 1], the function B(x,n) is of 
the order of n~ k , for some constant k > 0. Hence, we can say that we have 
a power law divergence for nearby orbits, for almost any orbit. 

This result confirms the experiments made on the logistic map at the 
chaos threshold. The constant k has been found to be approximately 1.3236 

(ED- 

We have thus derived an estimate for the behaviour of the Algorithmic 
Information Content for the logistic map at the chaos threshold, showing its 
connection with the sensitivity to initial conditions. We remark that the AIC 
is not a computable function, that is it does not exist an algorithm able to 
compute the AIC of any string. In the next section, we show how to obtain 
an estimate of the AIC for our map. 



11 



4 Compression Algorithms 



In Section ^ we have introduced the notion of Algorithmic Information Con- 
tent (AIC) of a finite string s, following the work of Chaitin (||) and Kol- 
mogorov (||13||). Since the notion of AIC is not computable, it has been 
approximated by other notions of information content of a string that can 
be computed. 

One measure of the information content of a finite string built on an al- 
phabet A can be defined by a lossless (reversible) data compression algorithm 

Z : E(A) - E({0, 1}) , 

that is a coding procedure such that from the coded string we can reconstruct 
the original string. Hence it is natural to consider the length of the coded 
string as an approximate measure of the quantity of information that is 
contained in the original string. We then define the information content 
Iz{s) of the string s as 

Iz{s) = \Z{8)l (9) 

where |Z(s)| denotes the length of the compressed string Z(s). The informa- 
tion content Iz (s) turns out to be a computable function and for this reason 
we will call it Computable Information Content (CIC). For a more accurate 
discussion on compression algorithms and the CIC we refer to Hj. 

4.1 A new compression algorithm: CASToRe 

In |l| and ||, it has been presented a new compression algorithm, called 
CASToRe, which has been created in order to give information on null en- 
tropy dynamics and has been used to study of sporadic dynamics, the 
Manneville map. In the following, the algorithm CASToRe will be used to 
evaluate the Computable Information Content Iz of the strings (equation 
(ffl)). As will be better shown later in the case of regular strings, this new 
algorithm will be a sensitive measure of the information content of the string. 
That's why it is called CASToRe: Compression Algorithm, Sensitive To 
Regularity. In particular, in || the algorithm CASToRe has been tested on 
fully chaotic and sporadic strings, confirming the theoretical results. 

We now give a short description of the algorithm. CASToRe is a com- 
pression algorithm which is a modification of the LZ78 algorithm (||14||). 
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As it will be proved in Theorem ( |4.1| ), the Information lz of a constant 
sequence, originally with length n, is 4 + 21og(n + l)[log(log(n + 1)) — 1], 
if the algorithm Z is CASToRe. The theory predicts that the best possible 
information is Iaic — log(n)+const. In [|I|, it is shown that the algorithm 
LZ7S encodes a constant n digits long sequence to a string with length about 
const + n.2 bits; so, we cannot expect that LZ78 is able to distinguish a 
sequence whose information grows like n a (a < |) (sporadic dynamics) from 
a constant or periodic one, while it has been proved (||) that CASToRe is a 
very useful tool to identify the correct exponent in the sporadic case. 

The algorithm CASToRe is based on an adaptive dictionary. Roughly 
speaking, this means that it translates an input stream of symbols (the file 
we want to compress) into an output stream of numbers, and that it is 
possible to reconstruct the input stream knowing the correspondence between 
output and input symbols. This unique correspondence between sequences 
of symbols (words) and numbers is called the dictionary. 

At the beginning of encoding procedure, the dictionary is empty. In order 
to explain the principle of encoding, let's consider a point within the encoding 
process, when the dictionary already contains some words. 

We start analyzing the stream, looking for the longest word W in the 
dictionary matching the stream. Then we look for the longest word Y in 
the dictionary where W + Y matches the stream. Suppose that we are 
compressing an English text, and the stream contains "basketball we 
may have the words "basket" (number 119) and "ball" (number 12) already 
in the dictionary, and they would of course match the stream. 

The output from this algorithm is a sequence of word- word pairs (W, Y), 
or better their numbers in the dictionary, in our case (119, 12). The resulting 
word "basketball" is then added to the dictionary, so each time a pair is 
output to the code-stream, the string from the dictionary corresponding to 
W is extended with the word Y and the resulting string is added to the 
dictionary. 

In the following subsections, we will estimate the information content 
Iz, where the algorithm Z is CASToRe, in the case of constant sequences 
and periodic sequences with period p > 1. Indeed this is important for the 
application of CASToRe to the logistic map at the chaos threshold (Section 

D- 
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4.2 CASToRe on constant strings 

We first study the case of period p — 1 (constant strings). If we have a string 
given by 

s = (aaaaaaaaaaaaaaa ) , (10) 

the algorithm CASToRe gives a codification 



1 

2 
3 
4 



(0, a) [a] 

(1.1) h 

(2.2) [aaaa] 

(3. 3) [aaaaaaaa] 



k : {k — 1, k — 1) [a ... a] 

so if we compute the information Iz(s n ), as function of the length n of the 
string to be compressed, we can immediately deduce that Iz(s n ) ~ 0(log 2 n). 
Indeed a much more accurate estimate of the CIC for constant strings is 
obtained in the following theorem. 



Theorem 4.1. Let s be a constant string as in §T(\), then the information 
function Iz{s n ) obtained applying the compression algorithm CASToRe is 
approximated by the function \l/(n); 

*(n) =4 + 2 log 2 (n + 1) (log 2 (log 2 (n + 1)) - 1) , (12) 

where the approximation is given by the identification of the integer part of 
log 2 (log 2 (n)) with its real value. 

Proof. We notice that the algorithm CASToRe acts on s in such a way that 
at the end of each term of the codification, the length n of the codified string 
is of the form n = 2 — 1. Then we look for the value of I(2 h — 1). We can 
easily obtain that if k = [log 2 h], where [•] denotes the integer part, then 

I(2 h - 1) = 2 + Zti 2 (2 i+1 - 2*) log 2 (2 i+1 ) + 2(n - 2 k ) log 2 (2 fc+1 ) 

= 2 + Eto 1 (^ + l)2 m + 2(n-2 fc )log 2 (2 fe + 1 ) (13) 
= 4 + (A;-l)2 fc+1 + 2(n-2 fc )log 2 (2 fc+1 ), 
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where we used 



Y,J2 J = (r-l)2 r+1 + 2. 

3=1 

At this point it is enough to approximate k with its real value log 2 h, and 
substitute to h its value log 2 (n + 1) to obtain equation (|I^). The theorem is 
proved. □ 



4.3 CASToRe on periodic strings 

We now look at strings s with prime period p > 1. From the point of 
view of the information content, we expect that periodic strings are not so 
different from constant strings. Indeed we find that the algorithm CASToRe 
is able to recognize periodic strings as constant strings after having encoded 
n p symbols, depending on p. Then, the CIC Iz{s n ) must behave as in the 
constant case for n > n p , indeed it is given by \P(n) plus a constant term 
C s that depends on the particular string s. This term C s can be estimated 
as the amount of information obtained from s at the n p -th symbol. We first 
prove the following theorem 

Theorem 4.2. Let s be a periodic string with prime period p > 1, then there 
exists a constant n p , depending on s, such that, when the algorithm CASToRe 
reaches the n p -th term of the string, it starts codifying the string s as if it 
were a constant string. The constant n p has as upper bound 



K p =p 



2 + |log 2 p+ ilog 2 



(14) 



where \ A\ is the number of symbols in the alphabet used to construct the string 
s. Note that K p depends only on the period p. 

Proof. Let us assume that \A\ = r. Since s has prime period p, we can look 
at it as a sequence of substrings, each one p symbols long; they will be called 
p-substrings. 

Once the codification has started and the first p characters have been 
read, the last encoded word will be t symbols long and will end at some 
k-th (mod. p) site in the second p-substring. Then, after at most (p + 1) 
subsequent reading of contiguous p-substrings, the algorithm will arrive at 
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the end of an already known word; this word will finish at some h-th (mod. 
p) site of the (p + 2)-th p-substring of s. 

Let us assume that the number of words needed by the algorithm to 
codify the fragment starting from the h-th symbol of a previous p-substring 
to its correspondent belonging to the (p + 2)-th p-substring of s is m; we will 
indicate the collection of that m words by !F{h). Let us analyse the different 
cases. 

First case: m = 2 C . The algorithm continues to process the remaining 
part of s, codifying the next fragment J-{h) by coupling two words each time. 
So, after c steps only one word will be necessary and later the algorithm will 
follow the period. 

Second case: m = 2 Cl q, where q is odd. The algorithm first behaves C\ 
times as described above, and after just q words will be needed to process the 
following F{h). Let us call Ai, . . . , A q those known words. After one step, the 
algorithm CASToRe will have built the new words B\ = (A 1: A 2 ), . . . , B q -i = 
(A q -2,A q -i), and B q -i 1 1 = (A q ,Bi). Then, the algorithm will reach the 
period after c 2 steps, where c 2 = min{c GN:2 c >g+l}. 

Third case: m is odd. The algorithm will behave as in the second case, 
with ci = 0. 

So we have proved the existence of the constant n p . Let us estimate its 
upper bound, K p . It holds: 



where C = min{c e N : 2 C > m + 1}. Now, we estimate m. It does hold 
m < (p + 2) x S where S is the number of words necessary to cover up 
a p-substring. Since r different symbols are available in the alphabet A, 
then it must be S < N where N is the smaller integer such that it holds 



and we are interested in an estimation of N, let us look for the smallest 
integer iV such that 



n p <p(p + 2)[l + C] 



Since we have 
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Let N the real solution of the following equation: 

N = J^^1. 



Then the integer N we looked for is 0(y/rp) and it holds m < (p + 2)y/rp. 
So, we have C < 1 + log 2 (m + 1). Finally, if we approximate p + 2 with p, 
we obtain 

K p <p 2 



3, 1 
2 + ^ lo S2 P + g lo §2 r 



and the theorem is proved. 



□ 



5 Experimental results on the logistic map 

In this section we show how to obtain an estimation for the Computable 
Information Content Iz{s n ) for a finite string s n generated by the logistic 
map at the chaos threshold, using CASToRe as compression algorithm Z. In 
the remainder of this section we drop the subscript Z from the CIC, since 
we are always referring to CASToRe. 

The plan of our experiments on the logistic map is to apply CASToRe to 
strings generated by the logistic map f\ for different values of A that approx- 
imate the chaos threshold A^. In particular we consider two approximating 
sequences of values of A: the sequence of values Aj < Aoo at which the period 
doubling bifurcations occur (Section |5J]) and the sequence of values fij > A^ 
at which the inverse tangent bifurcations occur (Section |5.2| ). Thus we ob- 
tain approximations for the CIC at the chaos threshold from below and from 
above, and the two coincide. We can then say to have obtained an approx- 
imation for the CIC computed with CASToRe for the logistic map at the 
chaos threshold f\ x . 

We remark that the symbolic strings are obtained from the orbits of the 
logistic map at the chaos threshold considering the partition a = (A , A\) of 
[0, 1] given by A = [0, 1/2) and A\ = (1/2, 1]. The choice of this partition 
for the experiments is justified by its optimality as remarked in Section |3.2j . 



5.1 The period doubling sequence 



In Section |4.3| we have analysed the behaviour of our algorithm CASToRe on 
periodic strings, obtaining that the CIC I(s) of a periodic string s behaves 
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with respect to the length n of the encoded string as 



I(s n ) = *(n)+C s , (15) 

where ^(n) is given by equation ([TJ) and C s is a constant depending on the 
string s (Theorem |4.2| ). 

We start considering the logistic map for values of the parameter A in the 
sequence given by the period doubling bifurcations, for which values 

the logistic map is periodic. The sequence converges to the chaos 

threshold value Aqo from below and can be generated by the equation ([D 
(Section |3~T| ). 

We generated the terms of the sequence (Aj)j 6 n (where the differences are 
given up to 38 decimal digits) and iterated the logistic map 10 7 times. So 
we had strings of 10 7 symbols. When studying the CIC Ij{s n ) for each Xj, 
where s n is the symbolic orbit of the logistic map with parameter A = A,-, we 



first tried to obtain the constant terms Cj, as expected from equation ([151). 
We have obtained an increasing sequence of values Cj (for j that tends to 
infinity), and our results about the compression of periodic strings by the 
algorithm CASToRe (see Theorem |OD are supported by the computations 
of the following limits 

lim Mf!j ] ~ C i = i \/j G N (16) 



As expected, equation ( |16|) tells us that, after the compression of a long 
enough substring of our string s, we obtain the same information function as 
for constant strings, simply translated by a constant. If we denote by rij the 
number of symbols that the algorithm has to process before reaching that 
behaviour, we have rij — > oo as j —>■ oo. So we have that Cj — > oo as j — > oo. 

The approximation of the information content Ij{s n ) with a function \l/(n) 
of the form given by (|1^) plus a constant term is not very good for periodic 
strings with high period. This is because of the poorness of the upper bound 
of our computations: indeed, we know that this approximation is accurate for 
values of n > rij and there are strings whose rij is bigger than 10 7 , the length 
of our orbits. In our case the period pj of the strings Sj is already big enough 
for j = 12, indeed pu = 2 12 = 4098, so n\2 ~ 16 x 10 6 . This means that we 
have to look for another way of approximation to the information function 
for values of n lower than rij. At the bifurcation points it is well known from 
experimental results that the orbit separation is polynomial and, thanks to 
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Figure 1: On the left one can see the behaviour of some of the functions 
Sj(n) (dashed lines), that shows the monotonicity of these functions. The 
solid line is the function Soo(n) and one can see that it is an upper bound 
for the functions Sj(n). On the right there is the same picture but for the 
functions Sfc(n) that approximate S^iji) from above. 



the results in ||12|| , we can say that the Algorithmic Information Content is at 
most logarithmic. So, we can expect our Computable Information Content 
to behave like 

{Sj (n)^(n) for n < nj 
Cj + ~fy(n) for n > rij 

with Cj as given before. To find the functions Sj(n), we simply have to 
compute the fraction 

*i(») = ^, J£N. (18) 

First of all, we deduce from equation fll6|) that ]xm n ^. O0 Sj(n) = 1 for all 
j. Moreover the numerical experiments show that the sequence of functions 
(Sj(n))j G N is point- wise increasing in j and bounded from above (see figure 
|l|on the left). Then we define the bounding function S^ri) as 

5oo(n) = Km (Sj{n)x [0!nj] (n) + X [njl00] fa)) 

for each neN, where x [a b] i n ) denotes the characteristic function of the real 
interval [a, b]. 
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5.2 Inverse tangent bifurcations 

In order to establish the behaviour of the information function Ioc(s n ) at 
the chaos threshold, we applied the same method as before, building up a 
sequence (nk)k&i °f parameters approximating Aoo from above. 

In the interval [Aoo, 4], the logistic map has a general chaotic behaviour, 
except from narrow ranges (called periodic windows) of parameters for which 
the map is periodic. Inside each window, a new period doubling sequence can 
be identified, that leads the map to chaos. The behaviour becomes periodic 
from chaotic via a tangent bifurcation, that is we can find a fixed point of 
a given iterate of the map having 1 as its eigenvalue (0). The values of 
the parameter at which the tangent bifurcations occur converge to Aoo from 
above and can be generated using the following relation: 

c 

V>k — ^oo + -jj: 

where c is a suitable constant and 5 is the Feigenbaum constant. 

Using the same definition of Sk(n) given in equation (|18|) , we find a se- 
quence of functions such that lim n ^oo Sk{n) = +oo for all k EN, due to the 
chaotic behaviour of the map at fi k . 

As we can see in figure [I] on the right, the numerical evidence is that 
the sequence (Sk{n))keN is point- wise decreasing in k and the same function 
^ (n) found in the previous subsection is a lower bound for the sequence. 
Then we have 

lim S k (n) = S'oo(n) 

for each n. 



5.3 Results 

From the previous subsections we can draw some conclusions. First of all, 
we can say that for all n E N the information function Ij(s n ) of the logistic 
map (H) with parameter A = Aj (as shown in Section |5.1|) is 

l s (a») ~ S^nMn^An) + (*(n) + C^An) Vj E N, 



and with parameter X = jj, k (as shown in subsection |5.2|) is 

h(s n ) ~ S k (n)V(n) VkEN. 
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Moreover we have that 



lim Ij(n) = lim I k {n) = I^n) = S^n)^^) V n G N. 

j— »oo k— >oo 

From the numerical approximation of S^n) from above and below, we 
conjecture that S^n) is an increasing bounded function of n and there exists 
a constant Soo = lim^oo Sodn). 



4.5 - 



4 - 




2.5 - 



2 - 
1.5 - 

■| I i i i i i i i i i 

1e+06 2e+06 3e+06 4e+06 5e+06 6e+06 7e+06 8e+06 9e+06 1e+07 

encoded symbols 

Figure 2: The solid line is the limit function S^n), and dashed lines are the 
approximating functions Si(n) from above and below. 

Numerical estimates give a value of of more or less 3.5. In figure § 
the behaviour of functions Si(n) corresponding to some values of both the 
approximating sequences is shown. 

We have thus confirmed the theoretical result of Theorem |3.10| , proving 
that the logistic map at the chaos threshold is mildly chaotic (see Section 0). 

Finally we remark that the feature of /^(s™) and in particular of S^ri) 
is typical of the logistic map, and is not simply due to the fact that we are 
considering periodic orbits with period going to infinity. Indeed, from the 
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behaviour of the function S^n), we deduce that max n S^ri) < 3.7 and this 
means that for n = 10 3 , for example, ^(s" - ) is less than approximately 148. 
But we can construct periodic strings with period less than 100, such that 
the information after 10 3 symbols is more than 148. We can consider, for 
example, a string with period given by all the possible combinations of two 
symbols in pieces of length from 1 to 4. This string has period 98 << 2 7 , 
but 7(734) = 356, much more than 148. 
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